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Abstract 

Access graphs, which have been used previously in connection with 
competitive analysis and relative worst order analysis to model local- 
ity of reference in paging, are considered in connection with relative 
interval analysis. The algorithms LRU, FIFO, FWF, and FAR are 
compared using the path, star, and cycle access graphs. In this model, 
some of the expected results are obtained. However, although LRU is 
found to be strictly better than FIFO on paths, it has worse perfor- 
mance on stars, cycles, and complete graphs, in this model. We solve 
an open question from [Dorrigiv, Lopez-Ortiz, Munro, 2009], obtaining 
tight bounds on the relationship between LRU and FIFO with relative 
interval analysis. 

1 Introduction 

The paging problem is the problem of maintaining a subset of a potentially 
very large set of pages from memory in a significantly smaller cache. When 
a page is requested, it may already be in cache (called a "hit"), or it must 
be brought into cache (called a "fault" ) . The algorithmic problem is the one 
of choosing an eviction strategy, i.e., which page to evict from cache in the 
case of a fault, with the objective of minimizing the total number of faults. 



*A preliminary version of this paper will appear in the proceedings of the Thirteenth 
Algorithms and Data Structures Symposium. Supported in part by the Danish Council 
for Independent Research. Part of this work was carried out while the first and third 
authors were visiting the University of Waterloo. 
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Many different paging algorithms have been considered in the literature, 
many of which can be found in [3j [12] . Among the best known are LRU 
(least-recently-used), which always evicts the least recently used page, and 
FIFO (first-in-first-out), which evicts pages in the order they entered the 
cache. We also consider a known bad algorithm, FWF (flush-when- full) , 
which is often used for reference, since quality measures ought to be able 
to determine at the very least that it is worse than the other algorithms. If 
FWF encounters a fault with a full cache, it empties its cache, and brings 
the new page in. Finally, we consider a more involved algorithm, FAR, 
which works with respect to a known access graph. Whenever a page is 
requested, it is marked. When it is necessary to evict a page, it always 
evicts an unmarked page. If all pages are marked in such a situation, FAR 
first unmarks all pages. The unmarked page it chooses to evict is the one 
farthest from any marked page in the access graph. For breaking possible 
ties, we assume the LRU strategy in this paper. 

Understanding differences in paging algorithms' behavior under various cir- 
cumstances has been a topic for much research. The most standard measure 
of quality of an online algorithm, competitive analysis [17} I14j. cannot di- 
rectly distinguish between most of them. It deems LRU, FIFO, and FWF 
equivalent, with a competitive ratio of k, where k denotes the size of the 
cache. Other measures, such as relative worst order analysis [5j [6], can be 
used to obtain more separations, including that LRU and FIFO are bet- 
ter than FWF and that look-ahead helps. No techniques have been able 
to separate LRU and FIFO, without adding some modelling of locality of 
reference. 

Although LRU performs better than FIFO in some practical situations |18| . 
if one considers all sequences of length n for any n, bijective/average analysis 
shows that their average number of faults on these sequences is identical [2] , 
which basically follows from LRU and FIFO being demand paging algo- 
rithms. Thus, it is not surprising that some assumptions involving locality 
of reference are necessary to separate them. 

A separation between FIFO and LRU was established quite early using 
access graphs for modelling locality of reference [9], showing that under 
competitive analysis, no matter which access graph one restricts to, LRU 
always does at least as well as FIFO. This proved a conjecture in [4j, where 
the access graph model was introduced. Another way to restrict the input 
sequences was investigated in [lj. Using Denning's working set model [lOHllj 
as an inspiration, sequences were limited with regards to the number of 
distinct pages in a sliding window of size k. This also favors LRU, as does 
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bijective analysis [2j, using the same locality of reference definition as pQ. 
There has also been work in the direction of probabilistic models, including 
the diffuse adversary model [16] and Markov chain based models [15]. 

The earlier successes and the generality of access graphs, together with the 
possibilities the model offers with regards to investigating specific access 
patterns, makes it an interesting object for further studies. In the light of 
the recent focus on development of new performance measures, together with 
the comparative studies initiated in [H] , exploring access graphs results in the 
context of new performance measures seems like a promising direction for 
expanding our understanding of performance measures as well as concrete 
algorithms. 

One step in that direction was carried out in [7] , where more nuanced results 
were demonstrated, showing that restricting input sequences using the access 
graph model, while applying relative worst order analysis, LRU is strictly 
better than FIFO on paths and cycles. The question as to whether or 
not LRU is at least as good as FIFO on all finite graphs was left as an 
open problem, but it was shown that there exists a family of graphs which 
grows with the length of the corresponding request sequence, where LRU and 
FIFO are incomparable. Since LRU is optimal on paths, it is not surprising 
that both competitive analysis and relative worst order analysis find that 
LRU is better than FIFO on paths. Any "reasonable" analysis technique 
should give this result. Under competitive analysis, LRU and FIFO are 
equivalent on cycles. The separation by relative worst order analysis occurs 
because cycles contain paths, LRU is better on paths, and relative worst 
order analysis can reflect this. The fact that there exists an infinite family 
of graphs which grows with the length of the sequence where LRU and FIFO 
are incomparable may or may not be interesting. There are many sequences 
were FIFO is better than LRU; they just seem to occur less often in real 
applications. 

Comparing two algorithms under almost any analysis technique is generally 
equivalent to considering them with the complete graph as an access graph, 
since the complete graph does not restrict the request sequence in any way. 
Thus, LRU and FIFO are equivalent on complete graphs under both com- 
petitive analysis and relative worst order analysis, since they are equivalent 
without considering access graphs. 

In this paper, we consider relative interval analysis [13] . In some ways rela- 
tive interval analysis is between competitive analysis and relative worst order 
analysis. As with relative worst order analysis, two algorithms are compared 
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directly to each other, rather than compared to OPT. This gives the ad- 
vantage that, when one algorithm dominates another in the sense that it is 
at least as good as the other on every request sequence and better on some, 
the analysis will reflect this. However, it is similar to competitive analysis in 
that the two algorithms are always compared on exactly the same sequence. 
To compare two algorithms, LRU and FIFO for example, one considers the 
difference between LRU's and FIFO's performance on any sequence, divided 
by the length of that sequence. The range that these ratios can take is the 
"interval" for that pair of algorithms. For FIFO and LRU, |13j found two 
families of sequences /„ and J n such that linv^oo FIF °( J ")~ LRU ( 7 ") _ I 
and lining oo FIFQ ( J ")^ LRU(J n ) _ 1 _ __L__ They left it as an open problem 
to determine if worse sequences exist, making the interval even larger. In 
their notation, they proved: [-1 + - -^] C X(FIFO,LRU). We start 
by proving that this is tight: Z(FIFO, LRU) = [-1 + - -g^]. These 
results would be interpreted as saying that FIFO has better performance 
than LRU, since the absolute value of the minimum value in the interval is 
larger than the maximum, but also that they have different strengths, since 
zero is contained in the interior of the interval. We obtain more nuanced 
results by considering various types of access graphs, such as paths (Pn), 
stars (S]\[), and cycles (Cat), splitting the interval of [—1 + ^, \ — 4fc ] _ 2 ] into 
subintervals for the respective graph classes. Considering complete graphs 
(or cliques) implies that there are no restrictions on the input sequences, 
so this is equivalent to considering the situation without an access graph. 
Table [U shows our results. 

Comparing these results with the results from competitive analysis and rel- 
ative worst order analysis, both with respect to access graphs, it becomes 
clear that different measures highlight different aspects of the algorithms. 
All the measures show that LRU is strictly best on paths, which is not sur- 
prising since it is in fact optimal on paths and FIFO is not. On the other 
access graphs considered here, relative interval analysis gives results which 
can be interpreted as incomparability, but leaning towards deeming FIFO 
the better algorithm. Relative worst order analysis, on the other hand, 
shows that on cycles, LRU is strictly better than FIFO, and on complete 
graphs, they are equivalent. It has not yet been studied on stars, but an 
incomparability result for LRU and FIFO has been found for a family of 
graphs growing with the length of the input. 
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Table 1: Summary of Results 



Lower Bound Relative Interval Upper Bound 


Th. 


Z[FIFO,LRU] = 
I[FWF, A] 
[0, C I[FWF,FIFO] C 


1+ 1 1 1 

' k ' 2 4fc-2 

[o.i-i] 




m 

m 
i 


l p »[FIFO,A] = [0, 
T p x [FWF, A] = [0, 1 - i] 
[0, l-^ 1 ] C Z p « [FWF, FIFO] C [0, 


SI 

m 

El 


[-1+0(1), 1+0(1)] C X^[FIFO,.A] c [-1 + 6(1), i + 9(1)] 

I s ™ [FWF, B] = [0, |] 


El 
El 


[- 1 + f » 1 - 4^2 J £ I FIF0 > LRU] c 

I Cn [FWF, LRU] = 
[ '■(K^J- 1 )^ x P l c xC N[LRUjFAR] c 

[0, 1-^P] C I C «[FWF,FAR] C 
[0, 1 - C X Cn [FWF, FIFO] C 


i + 1 1 1 

A T fe ■ 2 4fc-2 

[o.i-i] 
[-^ i-i] 

[o, i - i] 
[o, i - i] 






El 

El 

El 
El 



A £ {FAR, LRU} and B £ {FAR, FIFO, LRU}. 
N = k + r, with 1 < r < k - 1, X r = r(x - 1) + [^r] with x = [log . 
N denotes N if N is even, and N — 1 otherwise. 



2 Preliminaries 

We have denned the paging algorithms in the introduction. If more detail 
is desired, the algorithms are described in [3]. 

An access graph for paging models the access patterns, i.e., which pages can 
be requested after a given page. Thus, the vertices are pages, and after a 
page p has been requested, the next request is to p or one of its neighbors 
in the access graph. We let N denote the number of vertices of the access 
graph under consideration at a given time. This is the same as the number 
of different pages we consider. We will always assume that N > k, since 
otherwise the problem is trivial, and let r = N — k. A requests sequence is a 
sequence of pages and the sequence respects a given access graph if any two 
consecutive requests are either identical or neighbors in the access graph. 
We let C(G) denote the set of all request sequences respecting G. 

We use the definition of fc-phases from [3] : 
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Definition 1 A request sequence can be divided recursively into a number 
of k-phases as follows: Phase is the empty sequence. For every i > 1, 
Phase i is a maximal sequence following Phase i — 1 containing at most k 



Thus, Phase i begins on the (k + l)st distinct page requested since the start 
of Phase i — and the last phase may contain fewer than k different pages. 
We generally want to ignore Phase 0, and refer to Phase 1 as the first phase. 

Similarly, we can define x-blocks, for some integer x, focusing on when a 
given algorithm A has faulted x times. 

Definition 2 A request sequence can be divided recursively into a number 
of x-blocks with respect to an algorithm A as follows: The 0th x-block is 
the empty sequence. For every i > 1, the zth x-block is a maximal sequence 
following the (i — l)st x-block for which A faults at most x times. 

The complete blocks are defined to be the ones with x faults, i.e., excluding 
the 0th block and possibly the last. □ 

There are some well-known and important classifications of paging algo- 
rithms, which are used here and in most other papers on paging [;3j: An 
paging algorithm is called conservative if it incurs at most k page faults 
on any consecutive subsequence of the input containing k or fewer distinct 
page references. LRU and FIFO belong to this class. Similarly, a paging 
algorithm is called a marking algorithm if for any A:-phase, once a page has 
been requested in that phase, it is not evicted for the duration of that phase. 
LRU, FAR, and FWF are marking algorithms. 

If A is a paging algorithm, we let A(I) denote „4's cost (number of faults) 
on the input (request) sequence /. We now adapt relative interval analysis 
from [13| to access graphs. Let A and B be two algorithms. We define the 
following notation: 



distinct requests. 



□ 



Min AB (n,G) 



min 

I\=n,IeC(G) 



B(I)} 



Max AjB (n,G) 



max \A(I) 

\I\=n,ieC(G) 



B(I)} 



Mm G (A,B) = lim inf 



Mm AB {n,G) 



n->oo 



n 
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M a , G (A,B) = lim sup MaX ^ (n ' G) 

n->oo n 

Definition 3 The relative interval of two algorithms A and B with respect 
to the access graph, G, is 

1 G {A,B) = [Mm G (A,B),Max G (A,B)} 

B has better performance than A if Max G (.4, B) > |Min G (^l, B)\. 

B dominates A HX G {A,B) = [0,(3] for some /? > 0. □ 

Note that in the above, Max G (A,B) = -Min G (^,^). 

This definition generalizes the one from [13] in that the original definition is 
the special case where G is the complete graph, which is the same as saying 
that there are no restrictions on the sequences. We omit G in the notation 
when G is complete. 

Note that if B dominates A, this means that A does not outperform B 
on any sequence (asymptotically), while there are sequences on which B 
outperforms A. Also, when Max G („4, B) is close to 0, this indicates that A's 
performance is not much worse than that of £>'s. 

The following general lemmas will prove helpful later. The first observation 
is well known for fc-phases [3]: 

Lemma 1 Any algorithm has at least b+k—1 faults on a sequence consisting 
of b complete fc-phases or b complete fc-blocks defined with respect to any 
conservative or marking algorithm. 

Proof Let p be the page requested first in Phase i and let I' be the subse- 
quence starting with the second request in Phase i and ending right after the 
first request in Phase i + 1. Since there are k different pages in /' different 
from p, and p is in cache right after it has been processed, any algorithm 
must fault at least once in Thus, an algorithm must fault at least k + 1 
times on Phase 1 and the first request in Phase 2, and then at least once for 
the next 6 — 2 A>phases, summing to b + k — 1. 

The only properties used in the above are the following: First, there are at 
least k distinct requests in a fc-phase, and, second, for any phase, the first 
request is different from any request in the previous phase; specifically, the 
first request in two subsequent fc-phases are different. Any conservative or 
marking algorithm gives rise to such /c-blocks. □ 
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Lemma 2 Assume that for two algorithms A and B, there exist functions 
/ and g such that 



• lim n ^ 00 Max^ jB (n,G) = oo, 

• for all / € C{G), A{I)-B{I) < /(6j) and \I\ > g(bi), where 6/ denotes 
the number of complete fc-phases or /c-blocks in /, and the /c-blocks 
are defined with respect to a conservative or marking algorithm, and 

• the limit lim^oo exists. 
Then Max G („4,£) < liim,^ gg. 

Proof In this proof, we will take the word "phase" to mean either a /c-phase 
or a /c-block. 

We define a sequence of request sequences as follows. For j > 1, let Ij be a 
sequence of length j such that Ij maximizes A(I) — B(I) over all sequences 
of length j. 

By construction, Max_4 i g(n,G) = max\n = jj e £/Q){A(I) — £>(/)} = A(Ij) — 
B(I 3 ) < /(&,,), and by assumption, \IA > s (6j.). Thus Max ^"' G > < ^y. 
Now, 



Max G (^, S) < limsup = hmsup 4^ = 1™ 4!r- 

i^oo ffCO/J b^oo b->oo g(b) 

The second to last equality holds since {&/. j j > 1} contains infinitely many 
values. Assume to the contrary that it had a maximum value bj., for some j'. 
That would mean that for any j, Max^d-Zj |, G) < max{/(6) | 1 < 6 < &/.,}, 
contradicting the assumption of the left-hand expression being unbounded. 

The last equality holds since we have assumed that the limit exists. □ 

The proof of the following is analogous to the lemma just proven. Note, 
however, that the function / in the second bullet has image in M~. 

Lemma 3 Assume that for two algorithms A and B, there exist functions 
/ and g such that 

• hm n ^oo Min^s (n,G) = -00, 
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• for all I € £(G), A{I)-B{I) > and |/[ > g(bi), where 6/ denotes 
the number of complete /c-phases or £>block in /, and the /c-blocks are 
defined with respect to a conservative or marking algorithm, and 

• the limit lim^oo exists. 
Then Min G (A B) > lim^ gg. 



3 Complete Graphs 

As remarked earlier, if the access graph is complete, it incurs no restrictions, 
so the result of this section is in the same model as |13| . In |13j , it is shown 
that [— 2fc~\ ] — 2^(FIFO,LRU). Below, we answer an open question 
from [13], proving that this is tight. 

Lemma 4 For any access graph G, 

-1 + t < Min G (FIFO, LRU) and Max G (FIFO, LRU) < - - — - — . 
k 2 4/l 2 

Proof We first consider the Min value. Suppose that a sequence / has 
b complete fc-phases. Since LRU is conservative and a complete /c-phase 
contains k distinct pages, it cannot fault more than bk + k — 1 times [3]. By 
Lemma H FIFO(I) > k + b — 1. Thus, FIFO(I) - LRU(I) > k + b-1- 
(bk + k — 1) = — b{k — 1). Each fc-phase must have length at least k, and 
lim^oo = -Y- Clear1 ^ min| 7 | =ni/e£(G) {FIFO(/) - LRU(/)} goes 

towards — oo as a function of n (see for instance the family of sequences J n 
from Lemma ED . Thus, by Lemma El Min G (FIFO, LRU) > = + 

We now consider the Max value. Given a request sequence /, we let -Bj 
denote the ith fc-block for FIFO. Assume that there are b complete k- 
blocks. FIFO faults k times per complete /c-block and up to k — 1 times for 
the possible final /c-block. Thus, FIFO(I) <bk + (k- 1). Assume that LRU 
faults ai times in Bi. By Lemma HJ LRU faults at least b + k — 1 times. 
Thus, Z b i=1 ai >b + k-l. 

We now compute a lower bound on the length of the request sequence / based 
on the number of complete /c-blocks in it and the algorithms' behavior on 
it. 

As a first step, with every request on which FIFO faults and LRU has a 
hit, we associate a distinct request where FIFO has a hit. Let r be such 
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a request to a page p in B^. Since it is a hit for LRU, p must have been 
requested in the maximal subsequence of requests I' consisting of k distinct 
pages and ending just before r. Consider the first such request, r', in I'. If 
it were a fault for FIFO, FIFO could not have faulted again on r. Thus, r' 
was a hit for FIFO and we associate r' with r. 

To establish that the association is distinct, assume that r' also gets associ- 
ated with a request r" . Without loss of generality, assume that r" is later 
than r. For FIFO to fault on both r and r", there must be at least k distinct 
pages different from p in between r and r" . However, since we are assuming 
that LRU has a hit on r", by the property of LRU, the page requested by 
r" must have been requested during the same k distinct pages. Thus, by 
the construction above, the page that gets associated with r" (and r) will 
be later than r, which is a contradiction. 

Thus, if LRU faults on times in by the procedure above, we identify at 
least k — ati distinct requests. In total, there are at least H\ =l (k — at) = 
bk — S^ =1 aj distinct hits for FIFO in I and, since there are b complete k- 
blocks, at least bk faults. Thus, the length of I is at least 2bk — Y*\ =l oii, 
and 



FIFO (J) - LRU(J) bk + k - 1 - S 



b 



\I\ ~ 2bk - T, b i=1 ai 

] 6 

< ^Ef ; if u < v and x < y < v , we have that 
bk + k — 1 — T, b i=1 ai bk + k - 1 - (6 + k - 1) _ b(k - 1) 



By the lower bound on ££ =1 aj above, and the arithmetic observation that 

u — y 
v-y 



2bk-Y? i=1 ai ~ 2bk - (b + k - 1) b(2k - 1) - k + 1 

Clearly, maxm =n! j e £((3){FIFO(I) — LRU(/))} is unbounded as a function of 
n (see for instance the family of sequences I n in Lemma [9]) . By Lemma [21 
Max G (FIFO, LRU) < = J _ _L_, since Hm^^ fe(2 ^ fc+1 = 

□ 



From |13j and Lemma HI we have the following: 
Theorem 1 Z(FIFO,LRU) = [-1 + | - 

The following gives general bounds that are applicable to all pairs of al- 
gorithms considered here, though in many cases better bounds are proven 
later. The proof was essentially given in the first paragraph of the proof of 
Lemma [31 
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Proposition 1 Let A be a conservative or marking algorithm and B be any 
algorithm for paging, then for any access graph G, Mm G [B,A] > — 1 + ^ 
and Max G [A,B] < 1 - ± 



3.1 FWF 



FWF performs very badly compared to the other algorithms considered here, 
LRU, FAR, and FIFO. The following is folklore: 

Lemma 5 For any sequence / and any conservative or marking algorithm 
A, we have A{I) < FWF (I). 

This implies that for any access graph G, A G (I) < FWF (I) and so 

Min G [FWF, LRU] = Min G [FWF, FIFO] = Min G [FWF, FAR] = 0. 
Thus, LRU, FIFO, and FAR all dominate FWF. 

The upper bound of 1 — \ from Proposition CD is tight for FWF versus either 
LRU, for any access graph containing a path on k + 1 vertices, and it is tight 
for FWF versus FAR on a path containing at least k + 1 vertices. Note that 
a cycle on k + 1 vertices contains a path on k + 1 vertices, but FAR does 
not behave identically on these two graphs. 



Theorem 2 For the path access graph P/v, where N > k + 1 (and for LRU 
for any graph containing Pfc+i), and A £ {LRU, FAR}, 



1 Pn [FWF,A] 



Proof Consider the sequence I n = (1, 2, . . . , k, k + 1, k, . . . , 2) n . For this we 
have LRU(I n ) = FAR(/ n ) = 2n + k - 1, and FWF(7 n ) = 2kn. Therefore, 

,. FWF(/„) - LRU(/„) .. FWF(/„) - FAR(/ n ) k - 1 
lim — — = lim r— : = — : — . 

n— >oo J n n— >oo / n k 

By Proposition CQ this gives Max Pjv (FWF, LRU) = Max Pjv (FWF, FAR) = 
1 — 4. Lemma [5] shows that LRU and FAR dominate FWF. □ 



The same tight result for FWF versus FIFO almost holds. 
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Theorem 3 For any graph G containing a path with k + 1 vertices, if k is 
odd, then 



Z G [FWF, FIFO] 



0,1 



and if k is even, then 

k 2 - k - 1 



0, 



k 2 



CZ G [FWF, FIFO] C 



0, 



fc-1 



Proof Let h = [(k + l)/2j. Define the subsequence 

Si = {h + i, h + i — 1, h, h — i, h — i + 1, h, h + i) 

and define the subsequence R which starts with page h and then requests 
Si, £2, ...j Sh-i- This initial part of every sequence in our family of sequences 
ensures that FIFO's order for faulting is always (h, h + 1, h — 1, h + 2, h — 
2, 2h — 1, 1). The value 2/i — 1 is k if is odd and k — 1 if k is even. 

Suppose A; is odd. Let /„ = (R, K n ), where J = (k + 1, k, 1, 2, .../c}' 1 and 
K n = J n . FWF and FIFO fault the same number of times on R. FWF faults 
2khn times on K n . On the first request to k + 1 in I n , FIFO evicts h. Thus, 
after the fault on k + 1, its only fault while going "left" (towards lower page 
numbers) for the first time in J is on h, and its only fault going "right" is on 
h + 1. On the ith iteration (i < h—1) of J, it faults on h—i + 1 going left and 
on h+i going right. On iteration h, it only faults on 1, so FIFO has the same 
cache configuration immediately after J as it had immediately before. Thus, 
FIFO has k + 1 faults on J, giving (k + l)n in all. The number of requests 



in K n is 2kn. Thus, limn 



oc 



FWF(J n )-FIFO(7„) 



2khn—{k+l)n 
2khn 



k ■ 



Suppose k is even. We define similar sequences, but let I n = (R,k,K n ), 
since k is not requested yet. FIFO will still fault k + 1 times on J, but 



lim n 



00- 



FWF(/ n ) - FIFO(J n ) 2khn - (k + l)n k 2 - k - 1 



2khn 



k 2 



Lemma [5] shows that FIFO dominates FWF. 



□ 



4 Path Graphs 

In this section, we analyze path access graphs, P/v, with N vertices. We 
assume that N > k + 1, since otherwise, results become trivial. 
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Lemma 6 For the path access graph Pjv, 

Max Pjv (FIFO, LRU) < - - — . 

2 2/l 

Proof Consider any request sequence /. We divide the sequence up into 
phases as described now (these are not fc-phases) . Initially, define a direction 
by where LRU makes its kth fault compared with its cache content. Without 
loss of generality, we assume this happens going to the right on the path. 

We start the first phase with the first request and later explain how subse- 
quent phases are started. In all the phases, we start to the left (relatively). 
In all phases, except the first, LRU has the first k — 1 distinct pages that 
will be requested during that phase in cache. In all phases, the first fault by 
LRU in the phase, after having processed the first k — 1 distinct pages, is 
to the right. We maintain this as an invariant that holds at the start of any 
phase, though the direction can change, as we will get back to at the end of 
the proof. The exception in the first phase, adding an extra k — 1 faults to 
the cost of LRU as compared with the analysis below, will not influence the 
result in the the limit for the length of the request sequence going towards 
infinity. 

We want to analyze a phase where LRU faults to the right before it faults 
to the left again. These faults to the right may not appear consecutively. 
There may be some faults in a row, but then there may be hits and then 
faults again, etc. Thus, assume that there are m maximal subsequences 
of requests to the right where LRU faults — all of this before LRU faults 
going to the left again. Assume further that these maximal subsequences of 
requests give rise to si, S2, ■ ■ ■ , s m faults, respectively, where, by definition, 
m > 1, and let s = E™ 1 Sj. 



k+t 



left 
H 



Si 
H 1- 



SlS 2 
H 1 1- 



S m bright 



For now, we assume that for all i, Si < k. Thus, LRU moves left and right at 
least m times; maybe more times where it does not give rise to faults. Since 
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it does not fault going to the left during these turns, the faults are to pages 
further and further to the right. Let -E^ght denote the extreme rightmost 
position it reaches during these faults to the right. 

When LRU faults again to the left after having processed bright; we consider 
the leftmost node -Ei e ft, where LRU faults after the s faults described above, 
but before it faults to the right again. We end the phase with the first 
request to E\ e f t after the s faults. We define subsequent phases inductively 
in the same way, starting with the first request not included in the previous 
phase, possibly leaving an incomplete phase at the end. 

We now consider the costs of the algorithms and the length of the sequence 
per phase. LRU faults s times going to the right during the m turns in the 
phase. Additionally, LRU must fault at least t times going from -flight 

to 

E'icft, where t is defined by there being k + 1 nodes between -Eieft an d bright) 
including both endpoints. This sums up to s + t faults. 

For FIFO, we postpone the discussion of the first s\ distinct pages seen in a 
phase. Just to avoid any confusion, note that these pages are immediately 
to the right of -Eieft (the endpoint of the previous phase) and thus not the 
pages that LRU faults on. After that, consider the maximal subsequence 
of at most k distinct pages. This subsequence starts with the (si + l)st 
distinct request (the last request to it before the S2 faults) and continues up 
to, but not including the first request that LRU has one of its S2 faults on. 
We know that there are at most k pages there, because LRU only faults s± 
times there. Assume that FIFO faults f\ times on this subsequence. Since 
FIFO is conservative, f\ < k. 

We define more such subsequences repeatedly, the (m — l)st of these ending 
just before LRU's first fault of the s m faults, and the mth including the s m 
faults and k of the k + t nodes before we reach E\ e ^. Finally, we return to the 
question of the first s\ distinct pages seen in the phase. These overlap with 
the u t pages" from the previous phase; otherwise we would not have started 
the phase where we did. If FIFO faults on one of these pages when going 
through the t pages in the previous phase, it will not fault on them again 
in this phase. Thus, we only have to count them in one phase, and choose 
to do this in the previous phase. In total, FIFO faults at most (E^a/i) + * 
times, and for alH, /j < k. 

The difference between the cost of FIFO and LRU is then at most (E^/j) + 
t-(s + t) = (E^i/i) - s = (^=i(fi - 1)) - - m). 

From the analysis of FIFO above, knowing that on a subsequence of length at 
most k, FIFO can fault at most once on any given page, if it faults /j times, 
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the subsequence has at least /j distinct pages. Given that the subsequence 
starts at the left end of the "sj pages" and ends at the right end of the "sj 
pages", all pages that FIFO faults on, except possibly the leftmost, must be 
requested at least twice, giving at least 2/j — 1 requests. So, the length of 
the sequence is at least (S^ 1 (2/ i — 1)) + 1. We now sum up over all phases, 
equipping each variable with a superscript denoting the phase number. 

First, the total length, L, is at least 

L > S,(S™\(2// - 1)) + V = ^tiVl) ~ ™ j + V. 

Since s expresses how far we move to the right and t how far we move to the 
left, and the whole path has a bounded number of nodes N, we have that 
ZjP > E jS i - N. Thus, L > (E^E^/f) - m> + s 3 ) - N. 

I has a number of complete phases and then some extra requests in addition 
to that. There must exist a fixed constant c independent of / such that the 
cost of FIFO on the extra part of any sequence is bounded by c. This follows 
since there is a limit of N on how far requests can move to the right. So 
if requests never again come so far to the left that LRU faults, all requests 
thereafter are to only k pages. This added constant can also take care of 
the initial extra cost of k — 1. Since we are just using a lower bound on the 
sequence length, we can ignore the length of a possibly incomplete phase at 
the end. Thus, 

c + SjSg^-l)-^-^) 
-iV + E J (E^ 1 2//)-mJ+si 

c + E.-Eg^//-!) 

c + EjW(ife - 1) 
-N + T,jm>2k 

c+{k- l)E i mJ 
-N + 2kT,jmi 

The second inequality follows since s 3 > m? , and the third inequality follows 
because < ^ and k > fi implies that - J ~- < 

For sequences where the number of phases does not approach infinity, as 
argued above, FIFO's cost will be bounded. For the number of phases 



FIFO (J) -LRU(J) 



< 



< 
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approaching infinity, linx,-^ C ^ + 2 k ^0 = = \ ~ h> wnich implies the 
result. 

Now, for this proof, we assumed that Si < k. If Si > k, we simply terminate 
the phase after the processing of the Si requests that LRU faults on, and 
continue to define phases inductively from there. All the bounds from above 
hold with t = and the observation that FIFO will not fault on the first 
s\ requests in the next phase. The direction of the construction is now 
reversed. In this process, whenever we reverse the direction as above, we 
also rename the variable s to t and t to s, such that s continues to keep track 
of movement to the right and t of movement to the left, and the inequality 
T,jt j > Y, jS j - N still holds. □ 

Lemma 7 For the path access graph Pjy, 

Mslx Pn (FIFO, LRU) = ~ - — -. 

2 2,n 

Proof The upper bound was shown in Lemma [6l Consider the family 
of sequences I n = (1, 2, . . . , k, k + 1, k, k — 1, . . . , 2) n . In each iteration, 
except the first, LRU faults twice (on pages 1 and k + 1), whereas FIFO 
faults on pages 1 through k + 1 in every iteration. So on this family, 
lim n _ >00 FIFO ^"|^~ ^ RU ( J " ) = h^-k = i _ ^_ ) so the maximum must be at 
least that large. □ 



Since LRU is optimal on paths, this gives : 

Theorem 4 1 Pn [FIFO, LRU] = [0, \ - and LRU dominates FIFO on 
paths. 

Note that FAR and LRU perform identically on paths, so FAR also domi- 
nates FIFO with the same interval. 



5 Star Graphs 

We let Sn denote a star graph with N vertices. A star graph has a central 
vertex, s, which is directly connected to iV — 1 other vertices, none of which 
are directly connected. Thus, we could also see a star graph as a tree with 
root s and N — 1 leaves, all located at a distance one from the root. We 
assume that N > k + 1, since otherwise, results become trivial. 
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Lemma 8 For the star access graph Sn, 

— + -rr 1 — r < Min 5jv (FIFO , LRU) < — + — — — r + — jt- r 

2 2(fe — 1) — v ' ; ~ 2 2(fc - 1) 2k(k - 1) 

Proof Consider an arbitrary sequence I respecting the star access graph, 
and consider its division into ^-phases. Since the central vertex occurs after 
each request to a leaf, each /c-phase, except the last, must contain requests to 
k — 1 different leaves, and must be of length at least 2(k — 1). As in the proof 
of Lemma HI FIFO faults at least once for each of these phases. LRU faults 
only on the leaves and only once on each, so it faults at most k — 1 times 
for each phase. Thus, if I has b phases, not counting the first empty phase,, 
|J| > 2(fc-l)(6-l) + l andFIFO(/)-LRU(7) > (b-l)-(k-l)(b-l)-k = 
-(k - 2){b - 1) - k, and so Min 5jv (FIFO, LRU) > = ~\ + m=Tj- 

We will show that the upper bound on Min 5jv (FIFO, LRU) comes very close 
to this by analyzing the following sequence. 

In = (P, J n ), J = Bi, . . . , jBjfc_l 

P = (1, s, 2, s, . . . s, k — 2, s, k — 1, s, k — 2, s, . . . s, 2, s, 1, s) 
Bi = {k, s, k — 1, s, . . . , s, 1, s), for 1 < i < k — 1 

We note that /c does not appear in P and that all the Bi are identical (we use 
the index for reference). Each \Bi\ = 2k, so \I n \ = 2(2k — 3) + 2k(k — l)n. 
LRU starts B\ with a fault on the request to k, thereby evicting k — 1. 
It then faults on k — 1 and evicts k — 2. This repeats and ends with the 
eviction of k at the request to 1 such that k — 1 is the least recently used 
page. Thus, it faults everywhere except on the central vertex s, which is 
never evicted by LRU. Since LRU's cache configuration — content as well as 
the relative ordering of the recency of pages — is the same at the end of B\ 
as it was at the end of P, the same pattern must be repeated in each Bi. 
Thus, LRU(I n ) = k + {k- l)kn. 

FIFO has three faults in B\\ On the request to k, where 1 is evicted, and at 
the last two requests of B\. So FIFO ends B\ with 2 being outside its cache. 
From there onwards, FIFO faults exactly once in each Bi, 2 < i < k — 1, 
at the request to i, on which it evicts % + 1. Therefore, FIFO ends each J 
with k outside its cache and, hence, the above described fault and eviction 
pattern is repeated in every J. This gives the cost FIFO(/ n ) = k + (k + l)n, 
and limn^oo FIFQ ( J "^-^ RU ( 7 ") e q Ua ls 

k + (k + l)n - (k + (k - l)kn) 1 k + 1 

hm = 1 

rwoo 2(2k - 3) + 2k(k - l)n 2 2k(k-l) 
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Thus, Min 5 - (FIFO, LRU) < -\ + = -\ + ^ + □ 



Lemma 9 For the star access graph Sn, 



Max 5jv (FIFO, LRU) = - + 



8k -12 



Proof We give a sequence respecting Sn for iV > k + 1 giving rise to the 
stated ratio. Let 

/„ = (P,B n ), where P = (1, s, 2, s, . . . , s, k - 2, s, k - 1, s) and 5 is 



fc-2, a, 

fc - 4, s, 

k, s, 

k — 1, s, 



. 5, 2, 5; 1) Sj k ; 5 ; 1; Sj 2, Sj 

. 5 ? 1) Sj kj Sj k 1, kj Sj 1 ; 5j 

. 5 ? 5 ? k 1^ k 2. s ; /t s /c , 5, 



• s, 4, s, 
■ s j 3, s, 



3, 
2, 



2, 

1, 



3. 

2, 



s, 4, s, 
s, 3, s, 



. s, k — 2, s 

. Sj k 3 5 s 

. s, k — 4, s 

. s j kj s 

. s, k — 1, s 



Writing the sequence B like this is just to give an overview. The sequence 
is the concatenation of all the rows from top to bottom. 

The column in bold indicates the requests that are faults for LRU. LRU 
faults on exactly one request in every row and so we have LRU(/ n ) = k + kn. 
FIFO faults on k distinct pages in each row, starting with the request at 
which LRU faults. Thus, FIFO(/„) = k + k 2 n. Furthermore, \I n \ = 2(k - 
1) + (4k - 6)kn. Since 



lim 

n— >co 



FIF0 5jv (J n ) - LRU Sjv (J„) 



k + k 2 n — (k + kn) k — 1 



lim 

n^oo 2(k - 1) + {4k - 6)kn 4k - 6 ' 



we have that Max Sjv (FIFO, LRU) > ^ = \ + fCT- 

To prove a tight upper bound on Max Sjv (FIFO, LRU), we consider an ar- 
bitrary sequence /. We can assume without loss of generality that / does 
not contain any consecutive requests to the same page as they only result 
in hits for both algorithms, while increasing the length of the sequence. 

We view I as a partition of fc-blocks with respect to FIFO, denoted by 
B\, ... , B n , ignoring the first empty block. Since both FIFO are LRU are 
conservative, each block, excluding perhaps the last one, must have requests 
to at least k distinct pages. The access graph is a star, so each request must 
be followed by a request to s. The number of faults incurred by LRU in £>j 
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is denoted by a^, where a\ = k. From the maximality of the blocks Bi, each 
block must have at least one fault for LRU. Since s is never evicted from 
the cache by LRU, we have 1 < cnj < k — 1. 

We now find a lower bound on the length of B(. First recall that FIFO faults 
on k — 1 leaf requests. We now establish some hits by FIFO. Consider a leaf 
request r that is a fault for FIFO, but a hit for LRU. Since it is not a fault 
for LRU, there must have been a request r' to the same page in the last 
k — 1 distinct page requests. If r' were a fault for FIFO, then r would have 
to be a hit. Since it is not, r' must be a hit for FIFO. Since LRU incurs on 
faults in Bi, there are at least k — 1 — ctj distinct leaf requests where LRU 
has a hit while FIFO faults, ensuring at least k — 1 — ai distinct hits for 
FIFO. Note that even though the hit we establish for FIFO could be in the 
previous block, -Bj_i, it cannot be counted twice, since there are no more 
faults on that page after r' in B^\. 

The faults and the hits, together with the requests to s following each of 
them, gives us at least 2{k — 1) + 2(k — 1 — on) requests. Since the terms not 
involving n disappear in the limit, 



Max 5jv (FIFO, LRU) < max 



E"=2 k-ai 



2> 
£*,•>! 



Er= 2 1 (4fe-4-2a i ) 



This is maximized for a« = 1 for 2 < i < n. Hence, M&x Sn (FIFO, LRU) < 
fc-i 



4/c-6' 



□ 



The algorithms FAR and LRU behave identically on star graphs. Neither 
of them ever evicts the central vertex. We state the result for both LRU 
and FAR in the main theorem, though FAR is not directly mentioned in the 
lemmas and proofs. 



Theorem 5 For the star access graph Sn and A € {LRU, FAR}, 

C 1 Sn [FIFO,A] 

l I 1 ■ i I _l i 



1 4- . 1 I -4- 1 

2 2(fe-l) ' 4 8fc-12 



2 ^ 2(fc-l) ~ 2fc(fc-l)' 4 ~ 8fc-12 

Proof This follows directly from Lemmas [8] and E3 □ 

In [13], it was shown that Max(FIFO, LRU) > = \~ 3^2- The above 
result shows that for star access graphs, that bound can be decreased by a 
factor of approximately two. 
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Since LRU and FAR perform identically on stars, Min 5jv (FAR, LRU) = 
Max Sjv (FAR,LRU) = 0. 

The star access graph is another example of where FWF performs poorly 
compared with the other algorithms. 

Lemma 10 For the star access graph Sn, and B G {LRU, FIFO}, 

Max 5jv (FWF,£) < -. 

Proof Given any sequence I in Sjy, it can be viewed a partition of k- 
phases. Since it is a star, each phase must be of length at least 2{k — 1) and 
by Lemma [US must incur at least one fault in each phase. Since FWF can 
incur at most k faults in each phase, if there are n complete phases in I n , 
then FWF(g~g(J) < 0^ = i. Hence, Max 5 * (FWF, B) < \. □ 

Theorem 6 For the star access graph Sn, and A £ {LRU, FAR, FIFO}, 

X s n [FWF, A] = 

Proof By Lemma 

Min 5jv (FWF,LRU) = Min 5jv (FWF, FIFO) = 0. 

Furthermore, since LRU and FAR perform identically on star graphs, we 
also have that Min 5 * (FWF, FAR) = 0. 

Given any sequence I respecting 5* n, it can be viewed a partition of k- 
phases. Since Sn is a star, each phase must be of length at least 2(k — 1), 
and A must incur at least one fault in each phase. Since FWF can incur 
at most k faults in each phase, lim n _ 5 . 00 FWF (^)~-^( J ) < 2 ^~_}^ = |. Hence, 

Max Sjv (FWF,^4) < \. 

Consider the sequence I n = {P, (B\, -E>2) n }, where P = (1, s, 2, s, . . . , s, k — 
2,s,k - l,s), 

B>\ = {k, s, k — 1, s, . . . , s, 2, s), and P>2 = (1, s, 2, s, . . . , s, k — 1, s) 

P>i and B>2 have requests to k distinct pages, excluding 1 and k, respectively. 

LRU faults on the first request in each B>i. FWF flushes its cache at the 
start of each Bi. So \I n \ = 4(k - l)n + 2{k - l),LRU(/ n ) = 2n + k 
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and FWF(J n ) = 2kn + k. So lim^ FWF(/ n )-LRU(/ B ) = |g) 
Max Sjv (FWF,LRU) > ± 

Let I n = {P, B n ) where P = (1, s, 2, s, . . . , s, k — 2, s, k — 1, s) and 



and 



D 



k, s, k — 1 

1, s, 

2, a, 1 



fe- 2, s 
k — 1, s, 



A- 



s 
s 



The ith row is i-free. Hence, each row is of length 2{k — 1) and \I n \ = 
2(k — 1) + 2{k — l)kn. Since FIFO only faults on the first request in each 
row, FIFO(B) = k and FIF0(7 n ) = kn + k. Since FWF flushes its cache at 
the start of each row, it incurs k faults in each row. Therefore, FWF (5) = k 2 
andFWF(/ n ) = k 2 n+k. Therefore, lim n 



FWF(J n )-FIFO(J n ) 



and Max Sjv (FWF, FIFO) 



fc(fc-i) _ i 

2(fc-l)fc — 2 



> 



2" 



5at, by Lemma [TOJ we get Max 5jv (FWF, A) 



Since FAR and LRU behave identically on 



□ 



6 Cycle Graphs 

We consider graphs consisting of exactly one cycle, containing iV vertices. 
We assume that N > k + 1, since otherwise, results become trivial, and 
define r = N — k. We concentrate on the case where r < k, since otherwise 
the cycle is so large that for the algorithms considered here, it works as 
if it were an infinite path. Thus, for example, there are sequences where 
FIFO performs worse than LRU, but on worst case sequences, simply going 
around the cycle, the algorithms perform identically. In this section, it is 
convenient to work modulo iV when indexing pages on the cycle. Thus, if 
p < 1 or p > N, we let p denote the page p — 1( mod N) + 1. We will not 
mention this again later in the proofs to follow. 

The following sequences were used in |13|. Theorem 7] to show that [— 1 + 
I m = (P, B m ), where P = (1, 2, . . . , k - 1, k), and B is 
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k-1 k-2 ■■■ 2 1 

k-2 k-3 ■■■ 1 k + 1 

k-3 k-4 ••• k + 1 k 

k k- 1 ••• 3 2 



k+1 1 2 ••• fc-1 

k k + 1 1 ■■■ k-2 

k-1 k k + 1 ■■■ k-3 

1 2 3 ••• ife 



(P,B M ), where P= (1, 2, . . . , A; - 1, Jfe, Jfe - 1, . . . , 1), and 



B 



k+1 k k-1 

1 k + 1 k 

2 1 jfe + 1 

k-1 k-2 k-3 

k k-1 k-2 



3 2 

4 3 

5 4 

k k + 1 
2 1 



These sequences respect Ck+i, the cycle access graph on k + 1 vertices. 
Hence, that bound is applicable to cycles of length k + 1 as well. 



Proposition 2 For the cycle access graph Ck+i, 

1 1 



1°*+! [FIFO, LRU] 



-1 + 



1 



k 2 4/c 



Proof This follows from the results in [13], using the sequences above which 
respect the cycle, and Lemma [U □ 

We now generalize these results to values of N = k + r, where 1 < r < k — 1. 

Lemma 11 For the cycle access graph Cm, 

Min Cjv (FIFO, LRU) < -1 + £ and M\tl Cn (FIFO, FWF) < -1 + £ 

P, P"), where P = (1, 2, . . . , fc, . . . iV, 1, 2, . . . , r - 1) 



Proof We define J n 
and B is defined by 



B 



r r — 1 

2r 2r - 1 

3r 3r — 1 

N N — 1 



1 

r+1 
2r + l 

k + 1 



N N — 1 

r r — 1 

2r 2r - 1 

k k-1 



2r + 2 2r + l 

3r + 2 3r + l 

4r + 2 4r + 1 

r+2 r+1 
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The vertical line is merely for reference in the proof. 

Let R denote the number of rows in B. (Note that R = LCM ( N > r ) ; where 
LCM(N ,r) denotes the least common multiple of N and r.) There are 
r columns before and k — r columns after the vertical line. Thus, \J n \ = 
N + r - 1 + kRn. 

Observe that the sequence turns exactly once, namely after the first request 
in B. There are k — 1 hits following that request for both FIFO and LRU. 
After that, the sequence moves around the cycle, so LRU faults on all of these 
requests, giving a total cost of LRU Cjv (J n ) = N + r - k + kRn. Note that 
FWF faults on the same requests as LRU, so FWF Cjv (J n ) = LRU Cjv (J„). 

For FIFO, when processing (k + 1, . . . , N) in P, it evicts {1, . . . , r}, and then 
when processing (1, 2, . . . , r — 1), it evicts {r + 1, . . . , 2r — 1}. Then, at the 
very first request of B, it incurs the next fault and evicts 2r. After that, the 
set of pages outside its cache is {r+1, . . . , 2r}, and FIFO does not fault again 
in the first row of B. FIFO then faults on the first r requests in the second 
row, evicting {2r + 1, . . . ,3r}. This pattern continues, so FIFO only faults 
on the first r entries in each row of B. Therefore, FIFO c ' JV (J n ) = N + rRn. 

This gives 

Mi„^ (FIFO, LRU) < Urn "FO^ (J„) - LRU^( J„) 

n->oo \J n \ 

N + rRn - (N + r - k + kRn) 



lim 



N + r - 1 + kRn 



k — r r 
— = ~ l + k- 



Lemma 12 For the cycle access graph CV, 



□ 



Max^ (FIFO, LRU) > - 



2 4fc-2 
Proof Let I n = (So, S±, S n ), where 

Si = (i + k,i + k - 1,... ,i + 2,i + l,i + 2, ... ,i + k - l,i + A;). 

Clearly, FIFO(5 ) = LRU(S ) = k. 

In processing Si, LRU only faults on 1 + k, where it evicts 1, which is not 
requested in Si. In general, LRU faults only on the first request in each Si, 
evicting page i, which is not requested in Si. Hence, LRU(/ n ) = k + n. 
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FIFO faults on the first request in S\, evicting k, which is requested next. 
At that request k — 1 is evicted, leading to a fault on the following request, 
etc. In total, FIFO faults k times on Si and pages were brought into cache 
in the ordering i + k through i + Thus, in general, when the processing of 
Si + i starts, the situation repeats. Hence, we have FIFO(J n ) = k + kn. The 
length of the sequence is \I n \ = (2k — l)(n + 1). So, 

Ma*c» (FIFO, LRU) > li m FIFO(/„) - LRU(/„) 

n— Kx) J n 

k + kn — (k + n) 
= hm — — — r- 

n^oc (2k - l)(n + 1) 

_ k- 1 _ 1 1 
~2k - 1 ~2~ 4k -2 



□ 



Theorem 7 For the cycle access graph Cn, 
r 1 1 



-1 + 



k 2 Ak-2 



C Z Cjv [FIFO, LRU] C 



-1 + 



1 1 



1 



k'2 Ak-2 



Proof The left-most containment follows from Lemmas [TT] and [T2l and the 
right-most from Lemma HI □ 



Theorem 8 For the cycle access graph Cn, 



I Cjv [FWF,LRU] 



Proof Sequence, I n = (1,2, ... ,k,k + l,k, ... , 2) n , respecting Cn, gives 
the right endpoint in conjunction with Proposition [TJ The left endpoint is 
given by Lemma [5j □ 



The exact results to be presented sometimes depend on the relationship 
between k and N, e.g., whether or not r divides N (denoted r | N). To 
express many of the results, we need the following term that, for brevity, we 
will simply denote X r : 



X r = r(x - 1) + 



N 
2 X 



, where x 



i N 
log — 

r 



In the following lemma, we analyze FAR's behavior on the simplest sequence 
exploiting the cycle structure. 
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Lemma 13 For FAR and the sequence I n = (1,2, ... ,k, ... , N) n in Cn, 
each /c-phase, except the first and possibly the last, has X r faults, and 



nN 



X r + k-X r < FAR Cn (I n ) < 



nN 



X r + k-l. 



Proof In the given sequence, as in any other sequence, the first /c-phase 
contributes k faults. The first phase change in I n occurs at k + 1, at which 
all the other N — 1 pages are unmarked. Given that the sequence goes 
around the cycle n times, without turning, the properties discussed about 
faults in the second phase holds for all subsequent ones, with the possible 
exception of the last which may contain just one fault. Consider the fault 
incurred at the phase change at k + 1. The page evicted lies in the middle 
of the unmarked segment [k + 2, . . . , N, 1, . . . , k]. Following this, there are 
r — 1 more faults before the next hit. Each fault leads to the eviction of the 
page adjacent to the most recently evicted page, the evictions moving in the 
same direction in which the faults are encountered. 

In each phase, we refer to the first r faults as the first batch, faults numbered 
r + 1 through 2r as the second batch, and so on. If there are i batches of 
faults in one /c-phase, then the first i — 1 batches will contribute r faults 
each, and the last batch will have at least one and at most r faults. For the 
ith batch, we denote the length of the unmarked segment after marking the 
first page in the batch by di, and the distance to the page evicted at the first 
fault in the ith batch by Di. These distances are measured in the direction 
in which the faulting page was approached. Therefore, d\ = N — 1 and for 
i > 1, di + i = di — Di. Since LRU is used to break ties, if for some i, di is 
even, then the closer of the two midpoints is evicted at the first fault of the 
ith batch. Thus, we have the following dependencies: 

For i > 1, Di = \di/2] and d i+1 = di - D, L = [di/2\ 

From the recurrence di = |_dj-i/2j, we obtain the following relation: 



di-i 




1 


di-2 






di-2 




di 




N- 1 


2 




2 


2 






2 2 




2 i-i 




2 i-l 



A /c-phase ends when all the pages in the cache are marked and the next 
request will be a fault. At any given instant, the marked segment is a 
path in Cat. This implies that a phase ends when the r pages outside the 
cache constitute the unmarked segment, and one of those unmarked pages 
is requested. Therefore, if there are i batches in a /c-phase, then di + 1 < 2r. 
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Stated differently, the smallest value of i for which d{ + 1 < 2r gives the 
number of batches in a phase. 

If there is an i such that di+1 = 2r, then the phase has i batches contributing 
r faults each. Otherwise, if di + 1 < 2r, then the first i — 1 batches contribute 
r faults each and the last batch contributes fewer than r. 

It follows from the above that d{ + 1 = [t^t] • Solving C^r] < 2r gives 
i — l= [log batches with r faults each and the last with y = [t^t] ~~ r 
faults. Therefore, each phase in /„, excluding the first and perhaps the last, 
contains r(i — 1) + y faults. There are |_^-J complete phases in I r ^ n and 
if the last phase is not complete, that is, k \ nN, then the last phase can 
contain at most r(i — 1) +y — 1 faults. Thus, we obtain the following relation 
for FAR serving I n : 



nN 



(rx + y)+c< FAR Cjv (7 n ) < 



nN 
~k~ 



(rx + y) + rx + y - 1 + c, 



where x = [log ^-J , y 



\ 2*1 



r and c = k — (rx + y). 



□ 



The following lemma analyzes FAR's behavior on a cycle when the cycle 
structure is not used. Thus, the cycle access graph is used as a path access 
graph. However, FAR is oblivious to this and uses distances involving the 
non-utilized edge in the graph, leading to non-optimal results. 

From now on, whenever needed , we use N to denote N, if N is even, and 
N — 1, otherwise. 



Lemma 14 For FAR and the sequence /„ = (1, 2, . . . , k, . . . , N — 1, N, N — 
1, . . . ,2) n in Cat, each /c-phase, except the first (which has k) and the last 

(which has r), has rx + y faults, where x = log ^ and y = 



r. 



Proof The first /c-phase in has k faults. In any /c-phase of I n , excluding 
the first, the first set of r faults is called the first batch, faults numbered 
r + 1 through 2r is called the second batch, and so on. If there are i batches 
of faults in one /c-phase, then the first i — 1 batches will contribute r faults 
each, and the last batch will have at least one and at most r faults. 

As before, the length of the unmarked segment after marking the first page 
of the ith batch is denoted by di and the page located Di pages away is 
evicted at that fault. All these distances are measured in the direction in 
which the first fault of the batch was encountered. Note that within each 



26 



iteration within I n , there are two phase changes, occurring first at k + 1 and 
then at r. In the following discussion, we explain the behavior of FAR in 
one iteration within I n . Since the same properties hold for others, that will 
lead to a bound for FAR° N (I n ). 

At the end of a phase and right before the start of the next, FAR's cache 
is connected. Hence, the r pages outside the cache also form a connected 
component, implying that the sets of pages outside FAR's cache immediately 
before the phase changes at k+1 and r are {k+1, . . . , A^} and {r, r— 1, . . . , 1}, 
respectively. 

For the phase changes at k + 1 and r, the faulting request is approached 
from k and r + 1, respectively. For either case, we have d\ = N — 1 and as 
in Lemma [131 the page located D\ = \d\/2\ vertices away is evicted at the 
first fault in the phase. The next r — 1 faults lead to eviction of pages in the 
same direction in which the faults are encountered. Unlike in the previous 
lemma, the sequence considered here turns back at the end of the first batch 
and so the second batch of faults start at the most recently evicted page. 

Phase change at k + 1: The first fault in the second batch occurs when the 
sequence reaches D±, which is also the first page marked in the batch. The 
unmarked segment at that instant is {D\, D\ — 1, . . . , 1}. 

Phase change at r: Analogously to the previous case, the second batch of 
faults starts when the sequence reaches N — D\ + 1 . The unmarked segment 
at that instant is 



N - D x + 2, N - Dt + 3, . . . , N - D x + r, . . . , k, k + 1, . . . , N - 1, N . 



In either case, the length of the unmarked segment is di = D\ — 1. Note that 
for both locations of phase change, the change in direction of the sequence 
right after the first batch affects the resolution of ties in subsequent batches. 
In fact, if c?2 is even, then the farther of the two midpoints, measured in 
the same direction as the fault, is less recently requested than the other. 
Therefore, for each phase, we have the following correspondence: 



Since, in either case, from the second batch onwards, the sequence does not 
change direction for the rest of the phase, all subsequent ties within the 
phase are resolved in the manner of the second batch. Therefore, in any 
given phase, from the second batch onwards, if the unmarked segment is 




dijl + 1, if d2 is even 
\d 2 /2], if d 2 is odd 
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even, the farther of the two midpoints, measured in the same direction in 
which the fault was approached is evicted in favor of the other. This yields 
the following set of relations: d\ = N — 1, Di = [di/2], cfe = D\ — 1, and 
for i > 2, 



and 



di/2 + 1, if di is even 
\di/2 \ , if di is odd 



H+l 



di - Di 



di/2 — 1, if di is even 
[di/2\, 



if di is odd 



This implies that for all i > 2, di + \ 



dj-l 
2 



We now establish the following claim. Recall that TV denotes N, if N is 
even, and N — 1, otherwise. 



Claim 1 For i > 3, we have <U + 1= [~iz\ 



N 



Proof Since D\ = [^f^] , using the new notation, Di = y. 
We proceed to show by induction that for i > 3, di + 1 = [^^J ■ 
For the base case, i = 3, we have 



da 



d 2 - 1 




(A - 1) - 1 






2 




2 




2 



- 1. 



Hence, d 3 + 1 = L^J ■ 

Now, we assume that the induction hypothesis holds up to some i > 3. For 
the induction step, we prove the relation dt+i = L^i^J > by applying the 
hypothesis for dt in the last equality below. 



d 



t+i 



dt-l 




2 





^{d t + l)-l 





1 








2 


2 t-2 





Therefore, + 1 = |_2&rJ > an d the claim is proved. 



□ 



As was the case in the previous lemma, the last batch starts when for the 
first time in the current phase, the length of the unmarked segment is no 
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greater than 2r, i.e., the smallest value i for which di + 1 < 2r gives the 



number of batches in the phase. Solving 



A 



< 2r gives i — 1 



log 



A' 



Therefore, the first i — 1 batches in a fc-phase have r faults each. In the last 



batch, though, there are exactly 



A 



r faults. 



Right before the start of the ith batch, the length of the unmarked segment 
is di + 1. The phase must end when the length of the unmarked segment 
becomes r. Therefore, di + 1 — r is an upper bound on the number of faults 
incurred in the ith batch. □ 



Note that in the above proof, making the sequence go only up to some other 
value between k + 1 and N — 1, instead of up to N, would never give more 
faults. 



Lemma 15 For 1 < r < k — 1, in any sequence respecting the cycle access 
graph Cat, the maximum number of faults incurred by FAR in a /c-phase, 
excluding the first, is at most X r . In particular, FAR incurs the maximum 
number of faults in a fe-phase if the sequence takes the shortest path between 
any two faults in that phase. Consequently, in C^+i, each A:-phase can 
generate at most [~log(/c + 1)] faults for FAR. 

Proof Given the eviction rule of FAR in Cat, which is that it evicts the 
midpoint of the current unmarked segment, it follows that when a sequence 
does not turn inside a phase, it is taking the shortest path to the next fault. 
This situation is analyzed in Lemma[13l When a sequence turns such that at 
least one page is marked before the next turn, then all those pages become 
unavailable for eviction for the remainder of the phase. A phase ends when 
all the pages in the cache are marked and a new phase starts at the next 
fault. Therefore, if a sequence keeps moving along the shortest path which 
takes it to the next fault, then it is also marking the fewest number of pages 
in order to get to the next fault, thereby, maximizing the number of faults 
FAR incurs in the current phase. Hence, the maximum number of faults 
incurred by FAR in each phase, excluding the first, is upper bounded by 
X r , as proved in Lemma [13j The special case of Ck+i is given by r = 1 and 
so the lemma is proved. □ 

Lemma 16 For the cycle access graph CV, and A G {LRU, FIFO, FWF}, 

Min Cjv (A, FAR) > - r ~ 1 . 
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Proof Consider an arbitrary sequence I n in Cjv, where n denotes the 
number of fc-phases in the sequence. The last phase of a sequence may 
contain fewer than k distinct pages and in that case we can ignore the last 
phase in I n . Note that each phase contains requests to k distinct pages. It 
follows that each phase in a sequence is of length at least k and A incurs 
at least one fault in each of them. By Lemma fT5j we know that FAR can 
incur at most X r faults in each phase, excluding the first. By Lemma [H A 
faults at least once in each phase. In the first phase both algorithms incur 
k faults. Thus, in each phase, the absolute value of the maximum difference 
in faults is at most X r - 1. Thus, lim n ^ 00 ■ 4(J " ) ~^ R(/n) > □ 



Lemma 17 For the cycle access graph Cat, 



Min Cjv (FIFO, FAR) < 



k 



Proof Recall the sequence J n from the proof of Lemma [TH 

J n = (P, B n ), where P = (1, 2, . . . , k, k + 1, . . . , N, 1, 2, . . . , r - 1) 

and 



B 



r r — 1 
2r 2r - 1 
3r 3r — 1 



N N -I 



1 

r + 1 
2r + 1 



fc + 1 k 



N N -I 
r r — 1 
2r 2r - 1 



2r + 2 2r + 1 
3r + 2 3r + 1 
4r + 2 4r + 1 



r + 2 r + 1 



\J n \ = kRn + N + r- land FIFO Cjv (J„) = + rRn. 

There is exactly one turn in J n , which occurs at the first request in B and 
nowhere else. For the rest of the sequence, it moves around the cycle without 
turning. Hence, the number of faults incurred by FAR in each phase of B, 
excluding the first two, is given by Lemma [T3l to be X r = r(x — l) + , 
where x = [log f\ . Therefore, FAR Cjv (J n ) = |_^J X r + c, where c is a 
constant. The constant bounds the number of faults in the first two phases. 
Now, Min^ (FIFO, FAR) is at most 



lim 

n— >oo 



YWO CN (J n ) - ¥KR CN (J n ) 
\Jn\ 



nR(r — X r 



lim 

ra-»oo nRk + N + r 



X r 



□ 
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Lemma 18 For the cycle access graph Cn, and A £ {LRU, FIFO, FWF}, 

Max° N (A, FAR) > 1 - — . 

k 

Proof Consider the sequences I n = (1,2, . . . , N) n m C ^ such that k divides 
nN. It is easy to see that A(I n ) = \I n \ = Nn. By Lemma [T3l we have 
YAR CN (I n ) < ^X r + k-l. Thus, Max Cjv (A, FAR) is at least 



Hm ^(J n )-FAR^(/ W ) > Hm nN - rf Xr 



nN 



k 



Lemma 19 For the cycle access graph Cn, 



□ 



Mvo Cn (LRU, FAR) < 



r ^ 


logf 


-0 




N — 


1 



where N is N and N — 1 \{ N \s even and odd, respectively. 

Proof Consider the sequence I n = (1, 2, . . . , N — 1, N,N — 1, . . . , 2) n used 
in the proof of Lemma [141 Clearly, LRU c ' iV (/ ra ) = 2nr + k — 1 and = 
2 (AT — l)n. There are two phase changes in each iteration of I n , so by 
Lemma [T4l 



k + 2n r 



log 



TV 



+ 



2* 



< FAR Cjv (/ n ) 



where x 



logf 



Now, since r < 



LRU c ^(/„) -FAR c ^(/„) r 
hm — — < 



logf 


-0 







JV- 1 



□ 



When r = 1, we get the bound Min Cfc+1 (LRU, FAR) < - [log ^ J 1 . 
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Theorem 9 For the cycle access graph Cat, 
X r — r X r 



k 



C 1 Cn [FIFO, FAR] C 



X r -l _ 1 



N-l 



C [LRU, FAR] C 



X r -1 1 



and 



C X ^ [FWF, FAR] C 



Proof The first relation follows from Proposition Q] and Lemmas [T6| [TTl 
and!18[ and the second from Proposition [1] and Lemmas I16 | 1191 and [TBI The 
third result follows from Proposition [T] and Lemmas [5] and [T8l □ 



7 Concluding Remarks 

Relative interval analysis has the advantage that it can separate algorithms 
properly when one algorithm is at least as good as another on every sequence 
and is better on some. This was reflected in the results concerning FWF 
which is dominated by the other algorithms considered for all access graphs. 
It was also reflected by the result showing that LRU and FAR have better 
performance than FIFO on paths. The analysis also found the expected re- 
sult that FAR, which is designed to perform well on access graphs, performs 
better than both LRU and FIFO on cycles. 

However, it is disappointing that the relative interval analysis of LRU and 
FIFO on stars and cycles found that FIFO had the better performance, 
confirming the original results by [13] on complete graphs. Clearly, the 
access graph technique cannot be arbitrarily applied to all quality measures 
for online algorithms to show that LRU is better than FIFO. To try to 
understand quality measures better, it would be interesting to determine on 
which the access graph technique is useful for this well studied problem and 
on which it is not. 
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